CDS

Accession Number TCMCG004C60141
gbkey CDS
Protein Id XP_025644522.1
Location complement(join(50424725..50424762,50425170..50425305,50425783..50425914,50426112..50426376,50426939..50427012,50427177..50427292,50427884..50428214))
Gene LOC112738342
GeneID 112738342
Organism Arachis hypogaea

Protein

Length 363aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA476953
db_source XM_025788737.2
Definition thiol protease aleurain [Arachis hypogaea]

EGGNOG-MAPPER Annotation

COG_category O
Description Belongs to the peptidase C1 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
KEGG_ko ko:K01366        [VIEW IN KEGG]
EC 3.4.22.16        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko04142        [VIEW IN KEGG]
ko04210        [VIEW IN KEGG]
map04142        [VIEW IN KEGG]
map04210        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCGTGCTCATCGTCAACGCCGGCGGCGTGGTGGTTCACCATCGCGGTATTGTTCTTTGTCGTCGCCGCCGCATCGGCCGGGTGGAGCTCCGATGATCCGAACCCAATCCGAATGGTGCCGGACGAGCTCCGAGAAGTGGAGGCGGAGGTGGTTAGGGTCGTCGGGCGAACCCGGCACGCTCTCTCTTTCGCTCGGTTCGCCGTCAGGCATGGAAAACGATATGAGAGTCCCGAAGAGCTGAAGATGCGATTTGAAGTGTTCTCTGAGAACAAGAGGCTCATAAGATCTACTAACAGAAAGCGATTGTCGTACACTCTCGCCGTTAACCATTTTGCTGATTGGACTTGGGAGGAGTTCAAAAGACACAGACTAGGCGCAGCACAAAATTGCTCTGCTACCCTTAAGGGCAATCATAAGCTTACTGAAGCTGTTCTTCCTGAGACGCAAGACTGGAGAAAAGAAGGTATTGTTAGCCCAGTCAAAGATCAAGGCTCCTGTGGATCTTGCTGGACATTCAGCACAACTGGAGCTTTGGAAGCAGCCTATGCACAAGCGTTTGGAAAGAGCATCTCTCTTTCTGAGCAGCAGCTAGTGGATTGTGCTGGTGCTTTCAATAACTATGGTTGTAATGGTGGGTTGCCATCCCAAGCTTTTGAATACATCAAATACAGTGGTGGACTTGACTCAGAGGAAGCATATCCCTATACCGCGAAGAATGGTGTCTGCAAATTCAATGCTGAAAATGTTGCTGTTCAAGTCCTTGACTCTGTCAATATTACTTTGGGTTCTGAGGATGAATTAAAGCATGCAGTTGCTTTTGTTCGGCCAGTTAGTGTGGCATTTCAGGTGGTTGATGGTTTCCGATTCTATAAGGATGGTGTTTACACTAGTAACACTTGCGGTAGCACATCCCAGGATGTAAACCATGCTGTTCTCGCTGTTGGGTATGGTGTTGAAAATGGTGTCCCATATTGGCTTATTAAAAATTCATGGGGAAAATCATGGGGTGACGATGGTTACTTCAAGATGGAGCTGGGGAAGAATATGTGCGGTGTTGCTACTTGTGCCTCGTATCCAATTGTGGCTTAG
Protein:  
MACSSSTPAAWWFTIAVLFFVVAAASAGWSSDDPNPIRMVPDELREVEAEVVRVVGRTRHALSFARFAVRHGKRYESPEELKMRFEVFSENKRLIRSTNRKRLSYTLAVNHFADWTWEEFKRHRLGAAQNCSATLKGNHKLTEAVLPETQDWRKEGIVSPVKDQGSCGSCWTFSTTGALEAAYAQAFGKSISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKYSGGLDSEEAYPYTAKNGVCKFNAENVAVQVLDSVNITLGSEDELKHAVAFVRPVSVAFQVVDGFRFYKDGVYTSNTCGSTSQDVNHAVLAVGYGVENGVPYWLIKNSWGKSWGDDGYFKMELGKNMCGVATCASYPIVA